feat: EXC-1735: Move scheduling into the inner round #1757

berestovskyy · 2024-09-30T19:49:33Z

By moving the scheduling strategy into the inner round, we can adjust canister priorities within each round. This allows for greater flexibility and responsiveness to change canister priorities.

rs/execution_environment/src/scheduler.rs

rs/execution_environment/src/scheduler/tests.rs

alin-at-dfinity · 2024-10-02T07:15:57Z

rs/execution_environment/src/scheduler.rs

@@ -626,9 +626,9 @@ impl SchedulerImpl {
    #[allow(clippy::too_many_arguments, clippy::type_complexity)]
    fn inner_round<'a>(
        &'a self,
+        round_log: &ReplicaLogger,


Personal preference: My approach to argument ordering (and something that, accidentally or not, I see reflected in a lot of places) is:

at a high level, start with input arguments; then output arguments; and finally incidental stuff, such as logs and metrics;

within each of these groups (or more likely within the first) go by importance (e.g. first the state you are modifying, then the round, etc.; in this case, beyond the state it's all a bit subjective).

IOW, I would add the log at the very end (or at the very end, before the metrics).

alin-at-dfinity · 2024-10-02T07:41:40Z

rs/execution_environment/src/scheduler.rs

-                    self.rate_limiting_of_heap_delta,
+            let mut canisters = state.take_canister_states();
+
+            // Scheduling.


High-level comment: Does this change help move us closer to the quick fix? (I.e. does it make it easier to charge all canisters that got a chance at a full round or not?)

Because OTOH, we have both subnets that spend 15-20 ms scheduling; and subnets that do 12 inner loop iterations per round. Luckily no subnet happens to be in both those groups, but even assuming no increase in these numbers, there's nothing stopping a subnet from doing 250 ms worth of scheduling out of a 400 ms round.

This change shifts the schedule by four canisters per inner round, i.e. potentially 12 times faster. I'm aware of the potential performance impact but have prioritized other optimizations for the upcoming release...

alin-at-dfinity · 2024-10-02T07:47:57Z

rs/execution_environment/src/scheduler/tests.rs

+
+        let metrics = &system_state.canister_metrics;
+        // The inner round was skipped once before breaking the round.
+        assert_eq!(metrics.skipped_round_due_to_no_messages, 1);


Low priority: Could we avoid unnecessarily bumping this counter on every round when we execute all messages?

Or rename it to something like "rounds when we executed all messages"? (Although this can probably be inferred from the number of instructions executed in that round.)

Sure, we'll have performance optimizations in https://dfinity.atlassian.net/browse/EXC-1617 It's an orthogonal change to this MR.

rs/execution_environment/src/scheduler/tests.rs

github-actions bot added the feat label Sep 30, 2024

berestovskyy force-pushed the andriy/exc-1735-shedule-inner-round branch 2 times, most recently from 0a2a8f3 to f0ea8a7 Compare September 30, 2024 20:22

feat: EXC-1735: Move scheduling into the inner round

d4cd725

By moving the scheduling strategy into the inner round, we can adjust canister priorities within each round. This allows for greater flexibility and responsiveness to change canister priorities.

berestovskyy force-pushed the andriy/exc-1735-shedule-inner-round branch from f0ea8a7 to d4cd725 Compare September 30, 2024 20:24

berestovskyy changed the title ~~feat: EXC-1735: Move scheduling strategy into the inner round~~ feat: EXC-1735: Move scheduling into the inner round Sep 30, 2024

berestovskyy marked this pull request as ready for review October 1, 2024 07:08

berestovskyy requested a review from a team as a code owner October 1, 2024 07:08

github-actions bot added the @execution label Oct 1, 2024

berestovskyy requested a review from alin-at-dfinity October 1, 2024 07:09

dsarlis reviewed Oct 1, 2024

View reviewed changes

rs/execution_environment/src/scheduler.rs Show resolved Hide resolved

rs/execution_environment/src/scheduler.rs Outdated Show resolved Hide resolved

rs/execution_environment/src/scheduler/tests.rs Show resolved Hide resolved

Post-review fixes

d14b746

berestovskyy commented Oct 1, 2024

View reviewed changes

rs/execution_environment/src/scheduler/tests.rs Show resolved Hide resolved

Remove long execution filtering logic

a910436

alin-at-dfinity reviewed Oct 2, 2024

View reviewed changes

berestovskyy marked this pull request as draft October 2, 2024 10:59

berestovskyy closed this Oct 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: EXC-1735: Move scheduling into the inner round #1757

feat: EXC-1735: Move scheduling into the inner round #1757

berestovskyy commented Sep 30, 2024

alin-at-dfinity Oct 2, 2024

alin-at-dfinity Oct 2, 2024

berestovskyy Oct 2, 2024

alin-at-dfinity Oct 2, 2024

berestovskyy Oct 2, 2024

feat: EXC-1735: Move scheduling into the inner round #1757

feat: EXC-1735: Move scheduling into the inner round #1757

Conversation

berestovskyy commented Sep 30, 2024

alin-at-dfinity Oct 2, 2024

Choose a reason for hiding this comment

alin-at-dfinity Oct 2, 2024

Choose a reason for hiding this comment

berestovskyy Oct 2, 2024

Choose a reason for hiding this comment

alin-at-dfinity Oct 2, 2024

Choose a reason for hiding this comment

berestovskyy Oct 2, 2024

Choose a reason for hiding this comment